Document Image Quality: Making Fine Discriminations

نویسنده

  • Henry S. Baird
چکیده

We estimate, using synthetically generated images, the smallest changes in document image quality that can be distinguished reliably and fully automatically by Kanungo's bootstrapping test Kan96]. Six parameters of a physics{based document{image degradation model Bai92] are varied, one at a time: for each, over a range of parameter-value diierences, two sets of synthetic images are generated pseudorandomly and the two sets tested for statistical equivalence using Kanungo's method. The rate at which Kanungo's method rejects the hypothesis that the two sets are drawn from the same distribution is analyzed as a function of parameter diierence (a specialized \power function"). The nest discriminations aaorded by the method are given by the width of the power function at a low xed reject threshold. The data show that remarkably ne discriminations are possible { often subtler than are evident to visual inspection { for all six parameters. As few as 25 reference images are suucient. These results suggest that Kanungo's method is suuciently sensitive to a wide range of physics{based image degradations to serve as an engineering foundation for many image{quality estimation and OCR engineering purposes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

Determining the resolution of scanned document images

Abstract Given the existence of digital scanners, printers and fax machines, documents can undergo a history of sequential reproductions. One of the most important determiners of the quality of the resulting image is the set of underlying resolutions at which the images were scanned and binarized. In particular, a low resolution scan produces a noticeable degradation of image quality, and produ...

متن کامل

Integration of sensory evidence in motion discrimination.

To make perceptual judgments, the brain must decode the responses of sensory cortical neurons. The direction of visual motion is represented by the activity of direction-selective neurons. Because these neurons are often broadly tuned and their responses are inherently variable, the brain must appropriately integrate their responses to infer the direction of motion reliably. The optimal integra...

متن کامل

Learning Document Image Features With SqueezeNet Convolutional Neural Network

The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...

متن کامل

Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)

Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999